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The nucleic acid sequence of the full-length, chemically inducible Arabidopsis PR-1 promoter has been discovered and is disclosed 
herein. Furthermore, ciy-acting regulatory elements in the Arabidopsis VR-X promoter involved in chemical induction have been characterized 
using deletion and Tinker-scanning mutagenesis and in vivo footprinting. It has been discovered that at least a portion of the region of 
promoter between positions -698 and -621 (relative to the transcription start site of the PR- 1 gene) is required for induction of gene 
expression by chemicals. Two 10-bp linker-scanning mutations centered at 640-bp and 610-bp upstream from the transcription start site 
abolish the inducibility of the promoter while another 10-bp mutation centered at -670 bp results in average induced expression levels 4-fold 
higher than the unmutated promoter. Additionally, inducible in vivo footprints are located at positions -629 and -628 and at position -604 
on the coding strand and at position -641 on the non-coding strand. The use of chemically inducible Arabidopsis PR-1 promoter fragments 
to regulate gene expression in plants in the presence of inducing chemicals such as SA, INA, and BTH is disclosed, as well as the use 
of these elements for the isolation of transcriptional regulatory proteins involved in the promoter regulation and for the construction of 
inducible hybrid promoters. 
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CHEMICALLY-INDUCIBLE/IA4£/Z>0/»S/S PR-1 PROMOTER 

This application claims the benefit of U.S. Provisional Application No. 60/027,228, 
filed July 23. 1996, the disclosure of which is hereby expressly incorporated by reference in 
its entirety into the instant disclosure. 



FIELD OF THE INVENTION 

The invention generally relates to non-coding DNA sequences that regulate the 
transcription of associated DNA sequences in the presence of chemical regulators. The 
invention more particularly relates to the Arabidopsis PR-1 promoter, as well as deletion and 
linker-scanning mutants thereof, and its use in regulating gene expression in plants in the 
presence of chemical regulators. The invention further relates to sequences in the PR-1 
promoter that are necessary for induction by chemicals and that are involved in its inducibility 
as well as their use for the isolation of transcriptional regulatory proteins and for the 
construction of inducible hybrid promoters. 

BACKGROUND OF THE INVENTION 

Advances in recombinant DNA technology coupled with advances in plant 
transformation and regeneration technology have made it possible to introduce new genetic 
material into plant cells, plants, or plant tissue. The target plants can range from trees and 
shrubs to ornamental flowers and field crops and even to cultures of plant tissue grown in 
bioreactors. 

Regardless of the target of genetic engineering of plants, a principal advantage to be 
realized is the controlled expression of chimeric genes so that they are expressed only at the 
appropriate time, to the appropriate extent, and in some situations in particular parts of the 
plant. For example, the energy expended by a plant to continuously produce high levels of a 
foreign protein could prove detrimental to the plant, whereas if the gene were expressed only 
when desired, the drain on energy and therefore yield could be reduced. Additionally, the 
phenotype expressed by the chimeric gene could result in adverse effects to the plant if 
expressed at inappropriate times during development. For tissue in culture or in a bioreactor, 
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the untimely production of a desired secondary product could lead to a decrease in the growth 
rate of the culture, resulting in a decrease in the yield of the product. 

In view of such considerations, it is apparent that control of the time, extent, and/or site 
of expression of chimeric genes in plants or plant tissues would be highly desirable. An ideal 
situation would be the at-will regulation of expression of an engineered trait via a regulating 
intermediate that could be easily applied to field crops, ornamental shrubs, bioreactors, etc. 
Such gene control could be of particularly great commercial value. 

Several plant genes are known to be induced by various internal and external factors 
including plant hormones, heat shock, chemicals, pathogens, lack of oxygen, and light. For 
example, systemic acquired resistance (SAR) is a plant defense mechanism that occurs in 
diverse plant/pathogen interactions following a primary pathogen infection (Ryals, et aL, 
Plant Cell 8, 1809-1819 (1996), incorporated by reference herein in its entirety). It often 
leads to a hypersensitive response associated with the formation of necrotic lesions and to a 
substantial increase of the endogenous salicylic acid (SA) level. As a result, the plant 
becomes resistant to a variety of pathogens through a non-specific "immunization". This 
phenomenon is tighly correlated with the expression of several classes of genes, called 
pathogenesis-related (PR) genes (Ward, et al., Plant Cell 3, 1085-1094 (1991), incorporated 
by reference herein in its entirety). Interestingly, exogenous application of SA also induces 
SAR and expression of PR genes (Ward, et al. 1991; Uknes, et al., Plant Cell 4, 645-656 
(1992), incorporated by reference herein in its entirety) as well as of synthetic compounds, 
such as 2,6-dichloroisonicotinic acid (INA) (Vemooij, et aL, MPMI 8, 228-234 (1995) 
incorporated by reference herein in its entirety) and benzo(l,2,3)thiadiazole-7-carbothioic 
acid S-methyl ester (BTH) (Friedrich, et aL, Plant J. 10(1 ), 61-70 (1996), incorporated by 
reference herein in its entirety; Lawton, et aL, Plant J. 10, 71-82 (1996), incorporated by 
reference herein in its entirety). Therefore, induction of PR protein genes by chemicals or 
pathogens provides a foundation to address the problem of controlling gene expression in 

plants and plant tissue. 

Several steps in the signal transduction leading to the onset of SAR have been 
elucidated. Overexpression of a bacterial salicylate hydroxylase gene (nahG) has been shown 
to suppress SAR, indicating that SA accumulation is required for its onset (Gaffney, et aL, 
Science 261, 754-756 (1993), incorporated by reference herein in its entirety). Recently, SAR 
has been characterised in Arabidopsis and the corresponding PR- 1 , PR-2 and PR-5 genes 
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have been isolated (Uknes, et al. 1992), thus facilitating the Isolation of mutants with altered 
SAR. Mutants with constitutive SAR such as cprl (Bowling, et al.. Plant Cell 6. 1845-1857 
(1994)) have been described, as well as mutants defective in SAR such as nprl (Cao, et al. 

1994) and niml (Delaney, ex al., Proc. Nat. Acad. Sci. 92, 6602-6606 (1995). incorporated by 
reference herein in its entirety). In the case of nim 1 , a mutagenised population of A rabidopsis 
was screened for lack of PR- 1 gene expression after pathogen treatment (Delaney, et al. 

1995) . NIM I and NPR1 were subsequently shown to include mutations in the same open 
reading frame (Cao, et al. 1997; Ryals, etal., Plant Cell 9, 425-439 (1997), incorporated by 
reference herein in its entirety) that coded for a protein with extensive homology to the 
transcriptional repressor I-KB (Ryals, et al. 1997). 

Studies of the effects of chemical regulators on the promoters of several PR genes have 
also been described in the literature, shedding additional light of the SAR signal transduction 
pathway. Deletion analysis of the tobacco PR-1 a promoter revealed that a 600-bp long 
promoter had lost its functional inducibility by pathogen infection as well by exogenous 
chemical application while 661 -bp retained inducibility although to a lesser extent than a 903- 
bp long fragment (Uknes, et al., Mol. Plant-Microbe Interact. 6, 692-698 (1993), 
incorporated by reference herein in its entirety). Analysis of the tobacco PR-2d promoter 
revealed that some of its inducility is lost in a 607-bp long fragment but that 1047-b P were 
required for maximal induction, while a 321-bp long promoter had lost almost any 
inducibility (Hennig, et al. 1993). More recently, a Myb-like transcrition factor (myb I ) was 
isolated and its expression shown to be inducible by SA and tobacco mosaic virus (Yang, ex 
al. 1996). Furthermore it was shown to bind in vitro to a fragment of the tobacco PR- la 
promoter (positions -679 to -487 from the transcription start site) containing a Myb-like 
recognition site (positions -520 to -514). Moreover, a sequence in the tobacco PR-2d 
promoter (-348 to -324) was shown to bind in vitro to a yet unidentified protein. Mutation of 
this sequence reduced inducibility by SA by approximately 3-fold but not completely 
compared to wild-type sequence when included in a fragment spanning positions -364 to -288 
in the PR-2d promoter and fused to the -90 35S CaMV promoter (Shah, et al. 1 996). 

United States Patent No. 5,614,395, incorporated by reference herein in its entirety, 
describes the Arabidopsis PR-1 protein gene and its chemically inducible promoter. As 
described in this patent, the cDNA sequence encoding this PR protein was cloned into 
plasmid P APRlC-l, ATCC number 75049, deposited July 12, 1991. In addition, it was 



WO 98/03536 



PCT/US97/12626 



disclosed that the full-length promoter fragment (4.2 kb) from the PR- 1 coding sequence was 
isolated and cloned into plasmid pAtPRl-P, which was deposited January 5, 1994, with the 
Agricultural Research Culture Collection. International Depositing Authority and assigned 
NRRL number NRRL B-21 1 69. It was further disclosed in this patent that the full-length 
Arabidopsis PR-1 promoter fragment was fused to the firefly luciferase (LUC) gene and 
ultimately cloned into plasmid pAtPRl-S, which was in turn transformed into Arabidopsis 
plants for chemical induction analysis. The transgenic Arabidopsis lines carrying the PR-1 
promotcr/LUC gene fusion were then treated by spraying with isonicotinic acid (INA). When 
analyzed, the transgenic lines showed significantly higher induction of luciferase activity 
compared to water-treated controls. Thus, INA was shown to induce expression in 
transformed plants of a chimeric gene comprising the full-length Arabidopsis PR-1 promoter 
fragment. 

As described in U.S. Patent No. 5,614,395, only a PR-1 promoter/LUC gene fusion 
comprising the full-length (4.2 kb) PR-1 promoter was analyzed. In addition, the nucleotide 
sequence of the Arabidopsis PR- 1 promoter was not disclosed. However, it is often the case 
that smaller DNA fragments are easier to manipulate than larger DNA fragments. In this 
instance, by reducing the PR-1 promoter sequence to its minimal essential portion, creating a 
smaller DNA fragment, chimeric constructs comprising the minimal promoter fragment 
joined to a coding sequence could in certain situations be more easily made and utilized than 
constructs comprising a full-length promoter fragment. 

Therefore, one object of the instant invention is to determine the nucleotide sequence 
of the chemically regulable Arabidopsis PR- 1 promoter. Based on this knowledge, 
additional objects of the instant invention are to determine the minimal length of the 
Arabidopsis PR-1 promoter required for chemical induction by using deletion mutagenesis 
and to characterize cis-acting regulatory elements in the Arabidopsis PR-1 promoter involved 
in chemical induction by using linker-scanning mutagenesis and in-vivo footprinting. 

SUMMARY OF THE INVENTION 

The present invention provides the nucleic acid sequence of the full-length 
Arabidopsis PR-1 promoter, which is shown in SEQ ID NO: 1. In addition, the present 
invention encompasses chimeric genes comprising the Arabidopsis PR-1 promoter 
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The invention therefore describes important regulatory elements involved in the 
chemical induction of the PR-1 promoter. These elements can be used for the isolation of 
transcriptional regulatory proteins involved in the promoter regulation and for the 
construction of inducible hybrid promoters. These elements can further be used as probes for 
other chemically inducible promoters from Arabidopsis as well as chemically inducible 
promoters from other plants. 



FIG. 1 depicts the structure of linker-scanning mutants LS1-LS 13. The wt sequence is 
taken from SEQ ID NO: 1 . Nucleotides that are actually changed are shown in bold. 

BRIEF DESCRIPTION OF THE SEQUENCES IN THE SEQUENCE LISTING 

SEQ ID NO: 1 - Nucleic acid sequence of the full-length (4 T 258-bp long) Arabidopsis 
thaliana PR-1 promoter (in plasmid pLTDID). Also shown are the nucleic acid sequence of 
the 5' untranslated region as well as the start of the coding region for the PR-1 gene. 
Nucleotides 3444 - 4258 constitute the 815-bp long Arabidopsis thaliana PR-1 promoter 
fragment (in plasmid pLTD7D) that confers essentially the same levels of chemical induction 
of gene expression as the full-length (4,258-bp long) promoter. Nucleotides 3561 - 4258 
constitute the 698-bp long Arabidopsis thaliana PR-1 promoter fragment (in plasmid 
pLTD7)D) that also confers chemical induction of gene expression, although to levels 
reduced by approximately 3-4 fold as compared to the full-length promoter. Nucleotides 
3638 - 4258 constitute the 621 -bp long Arabidopsis thaliana PR-1 promoter fragment (in 
plasmid pLTD72D) that confers no chemical induction of gene expression. 

SEQ ID NO: 2 - primer extl . 

SEQ ID NO: 3 - primer 1 a. 

SEQ ID NO: 4 - primer 1 b. 

SEQ ID NO: 5 - primer lc. 

SEQ ID NO: 6 - primer Id. 

SEQ ID NO: 7 - primer N 1,076. 

SEQ ID NO: 8 - primer PR1R. 



BRIEF DESCRIPTION OF THE FIGURE 
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SEQ ID NO: 9 - primer N78 1 . 
SEQ ID NO: 1 0 - primer N704 
SEQ ID NO: 1 1 - primer LSI-. 
SEQ ID NO: 12 -primer LSI +. 
.5 SEQ ID NO: 13 -primer LS2-. 

SEQ ID NO: 14 - primer LS3-. 
SEQ ID NO: 15 - primer LS4-. 
SEQ ID NO: 1 6 - primer LS5-. 
SEQ ID NO: 1 7 - primer LS6-. 
•° SEQ ID NO: 18 -primer LS7-. 

) SEQ ID NO: 19 -primer LS8-. 

SEQ ID NO: 20 - primer LS9-. 
SEQ ID NO: 2 1 -primer LS2+. 
SEQ ID NO: 22 - primer LS3+. 
" SEQ ID NO: 23 - primer LS4+. 

SEQ ID NO: 24 - primer LS5+. 
SEQ ID NO: 25 - primer LS6+. 
SEQ ID NO: 26 - primer LS7+. 
SEQ ID NO: 27 - primer LS8+. 
20 SEQ ID NO: 28 - primer LS9+. 

SEQ ID NO: 29 - primer LS 10-. 
SEQ ID NO: 30 - primer LS 1 1 -. 
SEQ ID NO: 3 1 - primer LS 1 2-. 
SEQ ID NO: 32 - primer LS 1 02. 
" SEQ ID NO: 33 - primer LS 1 03. 

SEQ ID NO: 34 -primer LSI 12. 
SEQ ID NO: 35 - primer LSI 13. 
SEQ ID NO: 36 - primer LS 1 22. 
SEQ ID NO: 37 - primer LS 123. 
™ SEQ ID NO: 38 - primer LS 1 3-. 

SEQ ID NO: 39 - primer PI - 
SEQ ID NO: 40 - primer LMPCR2 
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SEQ ID NO: 4 1 - primer LMPCR3 
SEQ ID NO: 42 - primer P2- 
SEQ ID NO: 43 - primer P3- 
SEQ ID NO: 44 - primer P41+ 
SEQ ID NO: 45 - primer LMPCR1 
SEQ ID NO: 46 - primer P52+ 
SEQ ID NO: 47 - primer P53+ 

SEQ ID NO: 48 - recognition site of the yeast transcription factor GCN4 
SEQ ID NO: 49 - recognition site of the bZIP transcription factor CREB 
SEQ ID NO: 50 - recognition site for bZIP transcription factors 
SEQ ID NO: 5 1 - consensus recognition sequence of NF-kB 

DETAILED DESCRIPTION OF THE INVENTION 



is 



20 



25 I 



30 



The present invention encompasses various aspects of the Arabidopsis PR-1 promoter, 
including the discovery that a particular region is required for its induction by chemicals. The 
full-length Arabidopsis thaliana PR-1 promoter was originally isolated as a fragment having 
a length of 4,258-bp relative to the transcription start site of the PR- 1 gene. The nucleic acid 
sequence of the full-length promoter has been discovered and is shown in SEQ ID NO: 1. 

In addition, it has been discovered that at least a portion of the region of the 
Arabidopsis thaliana PR-1 promoter located between 698-bp and 62 1 -bp upstream from the 
transcription start site (between nucleotides 356 1 and 3638 of SEQ ID NO: 1 ) is required for 
chemical induction of gene expression. The present invention therefore encompasses an 
isolated DNA molecule that constitutes the full-length PR-1 promoter sequence, as well as 
solated DNA molecules that constitute relatively minimal PR-1 promoter sequences but still 
include the necessary region between nucleotides 3561 and 3638. These promoter sequences 
can be operatively linked to a coding sequence to form a chimeric gene, whereupon the 
promoter sequence will regulate transcription of the coding sequence. The chimeric gene can 
be cloned into a recombinant vector, which can then in tum be stably transformed into a host. 
The transformed host will then exhibit expression of the chimeric gene upon treatment with a 
chemical regulator. Thus, the present invention also encompasses chimeric genes comprising 
either the full-length PR- 1 promoter or one of the chemically inducible PR-1 promoter 
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transformation are those useful for transformation of plant cells or of Agrobacteria. For 
Agrobacterium-medizted transformation, the preferred vector is a Ti-plasmid derived vector. 
For direct DNA transfer into protoplasts, any of the aforementioned vectors may be used. 
Various chemical regulators may be employed to induce expression of the coding 
5 sequence in the chimeric genes constructed according to the present invention. In the context 
of the instant disclosure, "chemical regulators" include chemicals known to be inducers for 
PR proteins in plants, or close derivatives thereof. These include benzoic acid, salicylic acid 
(SA), polyacrylic acid and substituted derivatives thereof; suitable substituents include lower 
alkyl, lower alkoxy, lower alkylthio, and halogen. An additional group of regulators for the 
10 chemically inducible promoters sequences and chimeric genes of this invention is based on 
the benzo-l,2,3-thiadiazole (BTH) structure and includes, but is not limited to, the following 
types of compounds: benzo-l,2,3-thiadiazolecarboxylic acid, benzo- 1,2,3- 
thiadiazolethiocarboxylic acid, cyanobenzo-l,2,3-thiadiazole, benzo- 1,2,3- 
thiadiazolecarboxylic acid amide, benzo- 1 ,2,3-thiadiazolecarboxylic acid hydrazide, benzo- 
ls 1 ,2.3-thiadiazole-7-carboxylic acid, benzo- 1 ,2,3-thiadiazole-7-thiocarboxylic acid, 7-cyano- 
benzo- 1,2,3-thiadiazole, benzo- l,2,3-thiadiazole-7-carboxylic acid amide, benzo- 1,2,3- 
thiadiazole-7-carboxylic acid hydrazide, alkyl benzo- 1,2,3-thiadiazoIecarboxy late in which 
the alkyl group contains one to six carbon atoms, methyl benzo- l,2,3-thiadiazole-7- 
carboxylate, n-propyl benzo- l,2,3-thiadiazole-7-carboxylate, benzyl benzo- 1 ,2,3-thiadiazole- 
20 7-carboxylate, benzo- 1 ,2,3-thiadiazole-7-carboxylic acid sec-butylhydrazide, and suitable 
derivatives thereof. Still another group of regulators for the chemically inducible DNA 
sequences of this invention is based on the pyridine carboxylic acid structure, such as the 
) isonicotinic acid structure and preferably the haloisonicotinic acid structure. Preferred are 

dichloroisonicotinic acids and derivatives thereof, for example the lower alkyl esters. Suitable 
25 regulators of this class of compounds are, for example, 2,6-dichIoroisonicotinic acid (INA), 
and the lower alkyl esters thereof, especially the methyl ester. 

The chimeric genes constructed according to the present invention may be 
transformed into any suitable host cell; however, the chimeric genes are preferably 
transformed into plant tissue. As used in conjunction with the present invention, the term 
30 "plant tissue" includes, but is not limited to, whole plants, plant cells, plant organs, plant 

seeds, protoplasts, callus, cell cultures, and any groups of plant cells organized into structural 
and/or functional units. Plants transformed with the chimeric genes of the present invention 
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expression could be controlled. See. e.g., Uknes, Plant Cell, 4: 645-656 (1992); Ward et ai. 
Plant Cell 5:1085-1094 (1991); Gould. Bioscience 38: 26-33 (1988); and Gould. TIBTECH 
6: S 1 5-S 1 8 ( 1 988). Also, the chemical regulation of developmental processes such as 
homeosis. germination, tillering, sprouting, flowering, anthesis, fruit ripening, and abscission 
offers several advantages such as the facilitated production of hybrid seed, greater reduction 
of crop loss, and more generally, control of the growth and development of the plant by the 
farmer. Thus, the present invention applies equally to transgenic plants containing 
heterologous genes, e.g., disease resistance genes including PR and SAR genes, insect 
resistance genes such as BT genes, and genes involved in developmental processes such as 
those described above. It also includes genes encoding industrial or pharmaceutical 
biomaterials such as plastics and precursors thereof, perfumes, additives, enzymes and other 
proteins, and pharmaceuticals, wherein the plant effectively would be used as a bioreactor, 
eg., the two genes encoding production of potyhydroxybutyrate, a thermoplastic (Poirier et 

ai, Science 256: 520-523 (1992). 

As described more fully below in the examples, the full-length PR-1 promoter 
sequence was fused to the (J-glucuronidase (GUS) gene at the native ATG to obtain a 
chimeric gene cloned into plasmid pLTDlD. Plasmid pLTDID was then digested with 
restriction enzymes to release a fragment comprising the full-length PR-1 promoter sequence 
and the GUS gene, which was then used to construct the binary vector designated 
pCIB/LTDlD. This binary vector was transformed into Agrobacterium tumefaciens, which 
was in turn used to transform Arabidopsis plants. 

Plasmid pLTDID was also used to form a series of 5' end deletion mutants having 
increasingly shorter PR-1 promoter fragments fused to the GUS gene at the native ATG. 
Various restrict,™ enzymes were used to digest plasmid pLTDID to obtain the binary vectors 
with different lengths of promoter fragments. In particular, pLTDSD was constructed with a 
1,974-bp long promoter fragment; pLTD6D was constructed with a 1,293-bp long promoter 
fragment; pLTD61D was constructed with a 984-bp long promoter fragment; pLTD7D was 
constructed with a 815-bp long promoter fragment; pLTD71D was constructed with a 698-b P 
long promoter fragment; pLTD72D was constructed with a 621-bp long promoter fragment; 
o pLTDSD was constructed with a 572-bp long promoter fragment; and pLTD9D was 

constructed with a 78-bp long promoter fragment. Like the binary vector comprising the fuli- 
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length PR-, promoter fragment, tee 5 . cnd deletjo „ muiancs were also „a„ sformed 
^Sroiac,,™,,, ^/ac,^ a „ d . in Wn] ArabidopsU p|anB 

Each of the ,rans g e„ic Arabidopsis , inM was ^ fcy ^ ^ j ^ 
(INA). a Known .nducer of ,he PR- , promote, Gree „ ^ was 
• — and subjected t0 a GUS cnzyrae ^ „ ^ 

- . — of induction of each chimeric g _ For eac „ wnsge „ jc ^ T ^ d 

express™ c, mA was obIained by divMi „ g ^ speclflc acuviiy ^ ^ ^ ^ 

by the specfic activity of an ui.trea.oi control sample. 

As «^-"Po»^m,=„, w -, h r NA ,p 1 a numnsWdwi , hlhech . inCT . 

meludmg ,he fulHengtn (4 . 25s . bp , oog) pR ., pr „ CI _ d ^ 
■nducon of OUSexpre.ion compared to consols. ,„ addition, me ,.974-op, ,.293-bp 984 
bp. and 8l3 .bp ,„„g promo,er fragment, yielded simiiar .nduction of GUS expression by 
INA. The 698-bp ,o„g promoter fragment stil, yielded high inducibility by FNA, although 
, need by approximately 3. fold compared ,0 the longer promoter fragments. However, the 
6 -bp, 572-bp. and ,S-„p ,c„ g promoter fragments vieided substantia,* „„ induction of 
GUS expression „ y INA. These results are shown in Table , , which presents ,h. average 
values of GUS acdvity in p m0 ,e MU,m g protein/min (rNA-treatedA.ntrea.ed controfc, for me 
transgemc I.nes containing the PR- 1 promoter constructs. 

The presence of the correct hybrid const™, in the transgenic „„es was confirmed by 
PCR amp„f,cat,on. ,NA induction of the endogenous PR- 1 gene was confirmed by Northern 
lo, ana,ys,s for transgenic ,i„es co„„,„i„ g ,„e U93 -bp. 6 8 9-bp. 62,-bp. 574-bp and 78-bp 
long construes (5 lines per construct) indicating that the ,oss of inducioHity of GUS 
expression was due ,0 ,he gene construe, a™, „„, ,„ , a c k of SAR-medlated induction of gene 
expression in these particular lines or samples. 

Thus, a, .east a portion of the region of the Aratkbp* tha lu,nc PR-I promoter 
located between positions 3561 and 3638 of SEQ ID NO: , (between 698-bp and 62l-6p 
upstream from the transcription star, site) is required for chemical inducon of gene 
expression. Its removal complexly abolishes PR-I promoter induction. Moreover, additional 
elements ,ocated between positions -8,5 and -698 „so contribute ,„ f,„ i„ du cib„i,y of me 

leng.h PR- 1 promoter can therefore be „pera,ive,y linked ,„ codi og senses ,0 form smaHer 
cons^s than can he formed using the fu„-,e„g,h PR . , „. As „o K d earlier, sho^r 
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DNA fragments are often more amenable to manipulation than longer fragments. The 
chimeric gene constructs thus formed can then be transformed into hosts such as crop plants 
to enable at-will regulation of coding sequences in the hosts. 

While a deletion analysis characterizes regions in a promoter that are required overall 
for its regulation, linker-scanning mutagenesis allows for the identification of short defined 
motifs whose mutation alters the promoter activity. Accordingly, a set of 13 linker-scanning 
mutant promoters fused to the coding sequence of the GUS reporter gene (LS 1 to LS 1 3. 
Figure 1) was constructed. Each of them contained a 10-bp mutation (8-bp for LSI 2) located 
between positions -705 and -578 (nucleotides 3554 to 368 1 of SEQ ID NO: . ) and included 
in a 1 ,293-b P long promoter fragment (nucleotides 2966 to 4258 of SEQ ID NO: 1 ). Each 
construct was transformed into Arabidopsis and GUS activity was assayed for 19 to 30 
independent transgenic lines. The presence of the correct hybrid construct in transgenic Lnes 
was confirmed by PGR amplification of all lines containing LS7 and LS 1 0 constructs and by 
random sampling of lines containing the other constructs. Amplified fragments were digested 
, with Xbal and separated on high resolution agarose gels to distingu.sh between the Afferent 
LS constructs. The effect of each mutation on promoter activity was compared to an 
equivalent number of transgenic lines containing the unmutated l,293-b P construct. Two 
repetitions resulting from independent plating of seeds and INA-treatments were earned out 
in every case. 

Most LS mutations had no effect or minor effects on the promoter activity. However, 
3 of them had dramatic effects on the promoter function. One construct. LS4 (introducing a 
mutation at positions -666 to -675 (nucleotides 3584 to 3593 of SEQ ID NO: 1 ),. resu.ted in 
2-fold higher average induction of GUS activity by INA than the control construct and 
approximately 4-fo.d and 3-fold higher average GUS activity for INA- and water-treated 
„ samples, respectively (Tab.e 2). This suggested that a negative regulatory element had been 
mutated in this construct or. less likely, that a sequence responsible for higher promoter 
activity had been introduced. Since both induced and uninduced expression levels were 
influenced, the mutation seemed to affect a constitutive negative regulatory element. 
Interestingly LS5 which contained a mutation adjacent to the mutation in LS4, also led to 3- 
,o fold higher average levels of GUS activity after water treatment but IN A-induced levels of 
activity for LS5 were similar to the control construct. This suggested that such a regulatory 
element may span the sequences affected by both mutations. 
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o f SEQ :;;n;;:™ m r - ai ^ - 636 * - 645 <~ — « 

of SEQ ID NO " POSi "°" S ^ " - 6 ' 5 *« » 3653 

, m N0 ' ' ' C ° mp,We ° f in^uoibility of OUS activity by INA was ob s h 
Average GUS values for both water and INA „e,,, m , • observed. 

• were simi ,ar to watered values " *" 

U >0, average OUS V 2 S C °"' aini " g C ° mrol * - — « 

U5 activity. These results are consistent with the presence nf » 

reguiatorveiementthacisnecessaryforinductionofPR , , ^ 
the LS7 and LS10 locations. *** eXPrCSSi ° n by INA in or 

10 INA-induction of the endogenous PR 1 

^^^^ ™; ~ ilored ^ N ---«-y Sis 

significantly differ from lin™ C '" ,Stn,C,l and did «• 

SAR J d , T ^ ^ 10 gWC C ™ St ™ - « * ,ac t of „ r higher 

^AK-mediated induction of eenp pt^o.- , mgner 

gene express.on m these particular lines or samples 
The sequences mutated in the linl^r c™„ • 

H» sequence muj, in - ^ *" *»* <™J 

Mutation in a l t ' ^ *** ^ " 

-v,,v a,so contatned interest ^ ' ^ " " 

D N o C ° mPared '° ~ "GGGACTTTTCC" (SEO 

q mutated ,„ the rematning linker-scanning constructs 

~d,„de,eJL;i?: e r oT:t:r a,o8 r y '~ ntswere 

promoters identical Foreachr t ^ ' W ° C °" ,P< "" ,d!! °" ning 
— . inability 71 " nk — ^ «ve !ines showing medium .„ 

cM* by INA were treated i„ paraile, with water. 0.32*™ tNA. and 5mM SA 
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The responses of the different constructs to SA and INA were similar when normalized to the 
effect of each compound on the control construct. This observation supports the commonly 
held assumption that INA and S A act through the same pathway for induction of expresston 
of PR-genes Induction of LS4 by S A was approximately 2-fold higher than induct.on of the 
control construct by S A, as observed before with INA. For the five lines containing LS7, an 
average 1 5-fold induction by SA was measured compared to 1.4-fold by INA (in each case as 
an average of two independent repetitions). Interestingly, an average 4.0-fold induction by 
S A was measured for LS 10 compared to 1 .6-fo.d by INA. This suggests that the effect of the 
mutation in LS10 was less dramatic than in LS7 and that this difference could only be 
detected under stronger inductive conditions such as the treatment with SA. 

Organ-specific GUS expression was examined in roots and floral tissues of three 
independent untreated lines per linker-scanning construct. In the contro. construct, some 
weak GUS expression was detected in male organs but no expression could reproduably be 
detected in other flowers parts or roots. In the linker-scanning constructs a similar pattern of 
GUS expression was observed indicating that the mutations did not dramatically affect organ- 
specific expression of the PR-1 promoter. However, for some constructs, the intensity of 
GUS expression in the male floral organs differed from the control construct. In the three 
lines containing LS4 and LS5, expression was higher whereas in the three lines conta.mng 
LS7 and LS10. almost no GUS activity was detected. Therefore, the mutations appear to 
have similar up and down regulating effects on the promoter activity in uninduced male flora, 
organs as well as in uninduced green tissue. 

M-v.vo footprinting is based on mediation of guanine bases at position N7 by 
dimethyisulfate (DMS) followed by specific cleavage of the methylated guanines by 
pyridine. Changes in DNA occupancy by DNA-binding proteins alter the accessibthty of 
s DNA by the mediating agent, thereby yielding changes in populations of cleaved molecules 
after piperidine treatment. After LM-PCR a "G" ladder is resolved on a sequencing gel and 
differences in intensities of specific bands can be related to differences in DNA protection at 

the particular guanines. 

Analysis of the coding strand revealed inducible footprints at positions -629 and -628 
30 and at position -604 (FIG. 1 ) (nucleotides 3630, 363 1, and 3655 of SEQ ID NO: 1. 

respectively). At both sites, an increased band intensity after INA treatment was detected, 
indicating defection of these guanines upon INA treatment. These two footprints are 
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located in sequences mutated by linker-scanning constructs LS8 and LSI 1 (FIG. 1). Neither 
construct seemed to significantly influence the regulation of the promoter, but they are 
surrounding the sequence mutated in construct LS10, which had lost inducibility by INA. 
This suggests that that the region of the PR- 1 promoter around this site undergoes changes in 
its occupancy by DNA-binding proteins. No other change in band intensities could be 
detected in the range of DMS concentrations tested (0.04% to 1%), suggesting that the 
described sites are the only ones affected after INA treatment. 

Examination of the non-coding strand revealed an inducible footprint at position -641 
(361 8 of SEQ ID NO: I ). In this case, too, deprotection was observed. The guanine at 
position -64 1 is included in the sequence mutated in linker-scanning mutant LS7, which had 
lost inducibility by INA and that contains a recognition site for basic leucine zipper 
transcription factors at positions -645 to -640. Different DMS concentrations did not reveal 
any other inducible footprint. 

These results show that the examined region undergoes changes in protein-DNA 
interactions and suggest that the above described elements are required for the induction itself 
("switch" element) and are not just binding sites for constitutive transcriptional activators. 
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EXAMPLES 

The invention will be further described by reference to the following detailed examples. 
These examples are provided for purposes of illustration only, and are not intended to be 
limiting unless otherwise specified. Standard recombinant DNA and molecular cloning 
techniques used here are well known in .he art and are described by T. Mamat.s, E. F. Fntsch 
and J. Sambrock, M^.l.rn«nin«: A 1 nhoratorv manual, Cold Spring Harbor laboratory, 
Cold Spring Harbor, NY (1989) and by T.J. Silhavy, M.L. Berman, and L.W. Enquist, 
r .. r ,;^, c ..irh Oene Fusions. Cold Spring Harbor Laboratory. Cold Spring Harbor, NY 
(1984) and by Ausubel, P.M. et aL. Current Protocols in Molecular Biology , pub. by Greene 
Publishing Assoc. and Wiley-Interscience (1987). 

A. Full-Length Arabidopsis PR-1 Promoter 

Example 1 : Deternunation of the Transcription Start Site of the Arabidopsis PR- 1 gene 

RNA was purified from frozen tissue by phenol/chloroform extraction followed by 
lithium chloride precipitation (Lagrimini et al. (1987), Proc. Natl. Acad. Sci. USA 84: 7542- 
7546) A PR-1 specific right-to-left "bottomstrand" oligonucleotide correspond to 
positions +59 to + 32 downstream from the PR-1 ATG (ext.: AAG AGC ACC TAC AAG 
AGC TAC AAA GAC G) (SEQ ID NO: 2) was labelled at its 5' end (Sambrook J. et al. 
(,989) Cold Spring Harbor Laboratory Press. Cold Spring Harbor, New York) and served as 
primer for the extension reaction carried out with AMV reverse transcriptase accordmg to the 
manufacturer's recommendations (Promega, Madison, WI). The extension product was 
separated on a 6% polyacryamide gel and its length was determined by comparison wuh a 
sequence ladder obtained with pAtPRl-R (U.S. Patent No. 5,614,395) as a template and 
oligonucleotide extl as a primer using the Sequenase Version 2.0 DNA sequencing k,t 
according to the manufacturer's conditions (USB, Cleveland, OH). 
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Example 2: Fusion of the Arabidopsis Full-Length PR-1 Promoter to the 
Coding Sequence of the P-glucuronidase (GUS) Gene 

Plasmid pAtPR 1 -R was used as a template in PCR with a left-to-right "topstrand" 
primer extending from positions -268 to -25 1 upstream of the PR- 1 ATG (primer 1 a: GGC 
AAA OCT ACC GAT AC) (SEQ ID NO: 3) and a right-to-left "bottomstrand" primer 
comprising ] I bp of GUS coding sequence extending up to the GUS ATG and a further 9 bp 
of PR- 1 sequence extending from the ATG into the PR- 1 untranslated leader (primer 1 b- 
GGA CGT AAC ATT TTT CTA AG) (SEQ ID NO: 4). This PCR reaction was undertaken 
with AmpliTaq DNA polymerase according to the manufacturer's recommendations (Perkin 
Elmer/Roche, Branchburg, NJ) for five cycles at 94°C (30 s), 40°C (60 s), and 72°C (30 s) 
followed by 25 cycles at 94°C (30 s), 55°C (60s)and72°C (30 s). This generated a 
product of 278 bp through annealing of the homologous PR-1 sequences; the fragment 
included a Bglll site at its left end from the PR-1 promoter. A second PCR reaction was done 
using plasmid P BS-GUSI.2 (Uknes etai. The Plant Cell 5:159-169 (1993)) as a template and 
using a left-to-right "topstrand" oligonucleotide, which comprised 9 bp of PR-1 untranslated 
leader up to the PR- 1 ATG and a further 1 1 bp of GUS sequence from the ATG into the GUS 
coding sequence (primer Ic: CTT AGA AAA ATG TTA CGT CC) (SEQ ID NO: 5) and a 
right-to-left "bottom strand" oligonucleotide extending from positions -502 to -518 upstream 
of the PR-1 ATG into the GUS coding sequence (primer Id: TTA CGC TGC GAT GGA TC) 
(SEQ ID NO: 6). This PCR reaction was done under the same conditions as the one 
described above and generated a fragment of 527 bp through annealing of the homologous 
GUS sequences; this fragment included a SnaBl site at its right end derived from the 
amplified GUS sequence. 

The two PCR fragments generated above were gel purified using standard procedures 
to remove oligonucleotides and were then themselves mixed in a further PCR reaction 
("inside-outside PCR") with primers 1 a and 1 d. Conditions for this reaction were the same as 
described above. The amplified fragment was a fusion of the PR- 1 promoter fragment from 
the first PCR reaction described above and the GUS 5' coding sequence from the second PCR 
reaction described above and had a BgUl site at its left end and a .SnaBl site at its right end. 
The fragment was gel purified and cleaved with B g m and SnaBl (all restriction enzymes were 
purchased from Promega, Madison, WI) to yield a product of 497 bp in size that was ligated 
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in a three-way ligation with a Bglll-SacI fragment of pAtPRl-R containing PR-1 promoter 
sequences upstream from the BglH site and a SnaBI-SacI fragment of pBSGUSl.2 containing 
the 3' end of GUS to obtain pLTD 1 D. 

B. Deletion Mutant Analysis 

Example 3: Preparation of Chimeric Genes Containing Variable Lengths of the 
PR- 1 Promoter Sequence Fused to GUS 
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A. Construction of pLTD5D 

Plasmid pLTD 1 D was digested with restriction enzymes Xhol and Hpal; the 
protruding ends of the Xhol site were filled-in with Klenow DNA polymerase (Promega, 
Madison, WI); and the resulting blunt-ended vector fragment containing a 1,974-bp long 
promoter fragment was self-ligated. 

B. Construction of pLTD6D 

Plasmid pLTDID was digested with restriction enzymes Xhol and SnaBh the 
protruding ends of the Xhol site were filled-in with Klenow DNA polymerase: and the vector 
fragment containing a 1,293-bp long promoter fragment was self-ligated. 

C. Construction of pLTD61D 

Plasmid pLTDID was used as template for PCR with a left-to-right "topstrand" 
primer comprising a Xhol restriction site and extending from position -1 ,019 to - 1 ,000 
upstream of the PR- 1 ATG (primer N 1 ,076: ACC GCT CG A GAA TIT TTC TG A TTC 
GGA GGG) (SEQ ID NO: 7) and a right-to-left "bottomstrand" primer extending from 
position -584 to -607 upstream of the PR- 1 ATG (primer PR1R: TAT TTG TTT CTT AGT 
GTT TCA TGC) (SEQ ID NO: 8). The PCR reaction was undertaken for 3 cycles at 94°C 
(30 s), 50°C (30 s), and 72°C (30 s) followed by 30 cycles at 94°C (30 s), 55°C (30 s) and 
72°C (30 s). This generated a 412-bp long fragment containing a Xhol site at its right end and 
a Ndel site at its left end. The fragment was gel purified, digested with Xhol and Ndel, and 
ligated between the Xhol and Ndel sites of pLTD ID, resulting in a 984-bp long PR- 1 
promoter fragment fused to GUS. 
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D. Construction of pLTD7D 

Plasmid pLTDID was digested with restriction enzymes Xhol and Bstm, the 
protruding ends of both Xhol and BstEU sites were f,Hed-in with Klenow DNA polymerase; 
and the vector fragment containing a 8 1 5-b P long promoter fragment was self-ligated. 

E. Construction of pLTD7 ID 

Plasmid pLTDID was used as template for PCR with a left-to-right "topstrand" 
primer comprising a Xhol restriction site and extending from position -733 to -714 upstream 
of the PR- 1 ATG (primer N781 : ACC OCT CGA GAT AAA TCT CAA TGG GTG ATC) 
fSEQ ID NO: 9) and a right-to-left "bottomstrand" primer extending from position -584 to - 
607 upstream of the PR-1 ATG (primer PR1R: TAT TTG TTT CTT ACT GTT TCA TGC) 
(SEQ ID NO: 8). The PCR reaction was undertaken for 3 cycles at 94°C (30 s), 50°C (30 s) 
and 72°C (30 s) followed by 30 cycles at 94°C (30 s), 55«C (30 s) and 72«C (30 s). This 
generated a 1 26-bp long fragment containing a Xhol site at its right end and a Ndel site at its 
left end. The fragment was gel purified, digested with Xhol and Ndel, and ligated between the 
Xhol and Ndel sites of pLTD ! D, resulting in a 698-b P long PR- 1 promoter fragment fused to 
GUS. 

F. Construction of pLTD72D 

Plasmid pLTDID was used as template for PCR with a left-to-right "topstrand" 
primer comprising a Xhol restriction site and extending from position -656 to -637 upstream 
of the PR- 1 ATG (primer N704: ACC GCT CGA GTT CTT CAG GAC TTT TCA GCC) 
(SEQ ID NO: 10) and a right-to-left "bottomstrand" primer extending from position -584 to - 
607 upstream of the PR- 1 ATG (primer PR 1 R: TAT TTG TTT CTT AGT GTT TCA TGC) 
(SEQ ID NO: 8). The PCR reaction was undertaken for 3 cycles at 94°C (30 s), 50°C (30 s), 
and 72°C (30 s) followed by 30 cycles at 94'C (30 s), 55°C (30 s) and 72°C (30 s). This 
generated a 49 bp long fragment containing a Xhol site at its right end and a Ndel site at its 
left end. The fragment was gel purified, digested with Xhol and Ndel. and ligated between the 
Xhol and Ndel sites of pLTDID, resulting in a 621-b P long PR-1 promoter fragment fused to 
GUS. 

G. Construction of pLTD8D 

Plasmid pLTDID was digested with restriction enzymes Xhol and Ndel; the 
protruding ends of both Xhol and Ndel sites were filled-in with Klenow DNA Polymerase; 
and the vector fragment containing a 572-b P long promoter fragment was self-ligated. 
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H. Construction of pLTD9D 

P lasm id pLTD.D was digested with <-*— -^-^ - 

h of both M and Sg m sues were r„Ud-in with Klenow DNA Polymer. 

Example 4: Plant Transformation 



A Construction of Binary Vectors 

Plasn*, pLTDlD was digested w«h restriction enzymes W and *d re,eas,ng a 

7 „t ,ha. was gel P-fied and inserted between the Sail and Sad sues of 

6 422-bp long fragment that was gei puru 

„C B2 ™ (O S Pa.en. No. 5.6,4,395). resu,.ing ,n pCIB/LT D1D . P.as m ,ds pLTD5D, 
LTD6D pLTD61D, pLTD7D, pLTD7 ,D. pLT72D. pLTD8D, and pLTD9D were dtgesteo 
pLTD6D. pLTDol " siting PR-, promoter-GUS gene fus,ons 

<4.,38-bp, 3.457 bp • ^ ^ ^ ^ ^ Sacl „ of 

^•; 1D , P~D. PCTB^B, and P C^TO,0. respect, 

The binary vector constructs w (1993)) 
, ru Acad Sci Paris, Sciences de la vie, 316.1 194 uv»^ 

GV31 „, <B-- ; ;' w « ^ Rep , :j (l987) , Arabldopsis „as t— 
by e,ec,r„po,at,on (Dowe, CR Acad . Sci . Pari s. Sciences de ia 

using the vacuum infiltrat.on method (Bechtold, „ . „ m seeds) ^ plated on 

vic 3 1 6., 1 94- 1 .90 (1 993,).Seedsob,..nedfro ro ,nmua,eopiants(T, seeds) 

2* orcein -ate and resistantT, plants were transfer.* to so,,. 
5 — intomai.usingthemethoddescribed f 

„JTJ: Biotechnology ,.: ,94.00, ,993, using pa.c,e bo m ba.n«n, into cel.s „, 

immature embryos. 

*r^T— - — into — wh.aten.b.osand 
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Biotechnology 11: 1553-1558 (1993), and Weeks etal., Plant Physiology 102: 1077-1084 
(1993). 

Example 5: Determination of the Inducibility of Gene Expression by Chemical Regulators 

A. Treatment with INA 

For each transgenic line, seeds obtained from Tl plants were harvested (T2 seeds) and 
plated on duplicate plates containing 50 mg/I of kanamycin sulfate. After twenty days, one 
plate was treated by spraying with 0.25 mg/ml INA while the duplicate was kept as a control. 
Three days later, green tissue was harvested, flash frozen, and kept at -70°C . 

B. GUS Enzyme Assay 

Frozen tissue was homogenised to a fine powder under liquid nitrogen. Extracts were 
prepared in GUS assay buffer (50 mM sodium phosphate pH 7.0, 0.1% Triton-X 100, 0.1% 
sarkosyl, 10 mM beta-mercaptoethanol) as described by Jefferson, R.A. et a/., Proc. Natl. 

is Acad. Sci. USA 83, 8447-8451 (1986). The reactions were carried out in the wells of 

microliter plates by mixing 10 jxl of extract with 65 u.1 of GUS assay buffer containing 4- 
methyl umbelliferyl glucuronide (MUG) at a final concentration of 2 mM in a total volume of 
75 nl. The plates were incubated at 37°C for 30 minutes and the reaction was stopped by the 
addition of 225 [i\ of 0.2 M sodium carbonate. The concentration of fluorescent indicator 

:o released was determined by reading the plates on a Flow Labs Fluoroskan II ELISA plate 
reader. Duplicate fluorescence values for each samples were averaged, and background 
fluorescence (reaction without MUG) was substracted to obtain the concentration of MU for 
each sample. The amount of protein in each extract was determined in a Bradford assay (Bio- 
Rad laboratories, Hercules, CA) according to the manufacturer's instructions. The specific 

5 activity was determined for each sample and was expressed in pmoles MU/mg 
protein/minute. 

Table 1 shows the average values of GUS activity (INA-treated/untreated controls) for 
the transgenic lines containing the PR-1 promoter constructs. Here, GUS values are 
expressed in pmole MU/mg protein/min, and the number of independent transgenic lines used 
for the determination of each value are shown in column (N). For each independent 
transgenic line, the induction of GUS expression by INA was obtained by dividing the 
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specific activtt, of .he INA-treated sample by the specific activity of .he untreated control 

sample. 

Table 1 

Promoter BJ^Btti gl^o* H 

!f 3 0 «8 

1.974-bp 13 ^ ]0 21 

1,293-bp 106 955 ^ u 

984-bp 36 , n#ui 19 20 

5 19 

698-bp 78 * Q9 19 

621-bp 57 -J 19 

572-b P 165 « -1 

,43 137 



78-bp 



10 



A, shown in Tabie ..upon batmen, with INA. plants transformed with the chimeric 
g e„e including .he full-length <4, 5 S-bp long, PR, Poorer <"«— 
Luctton of GUS expression compared t o controls. In addition, me ,.97** ..293-bp 9 8 4- 
bp and 8,5-bp ,on g promoter fragments yietded simiiar induction of OUS exprcss.on by 
1. The 6,8 bp ,ong proper fragment sti., yielded high inducible by INA, although 
reduced b, approximate,, 3-4 foid compared ,0 the .onger promoter fragr^Howeve . the 
6,,-bp. 572-bp. and 7S-bp ,on g promts fragments yieided substantially no tnductton of 
GUS expression by INA. 

C Treatment with BTH 

mstead of INA as described above in Example 5A. plan, material is sprayed w th BTH 
as described by Goerlach « cL. T„e P^< Celt 8: 629-643, ( 1996k Friedrich at., M 
Jo^al (.996V. and Lawton „ 1. Ptan, «.99«. BTH treated p,»n. tissue ,s then 
subjected to a GUS assay as described above in Example 5B. 

D Treatment with salicylic acid (SA) 

■nstead of INA as described above in Example 5A. p.an, materia. ,s sprayed w,th 
5mM SA, sodium sal, SA treat* plan, tissue is then subject* to a GUS assay as describe* 
above in Example 5B. 
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C. Linker-Scanning Mutant Analysis 

Example 6: Preparation of Chimeric Genes Containing Linker-Scanning Mutants of the 
Arabidopsis PR- 1 Promoter Sequence Fused to the P-glucuronidase (GUS) Reporter Gene 

A. Contraction of pLS 1 to pLS9 

Plasmid pLTD6D was used as a template in PCR with a left-to-right "topstrand" primer 
extending from positions -887 to -867 (primer And: AGG TAT ACT GGA GAT AGG 
AGG) upstream of the PR- 1 ATG and a right-to-left "bottomstrand" primer comprising 26-b P 
of PR- 1 promoter sequence (positions -715 to -741 upstream of the PR- 1 ATG) and a further 
10-bp containing a Xbal restriction site (primer LS 1-: GCT CTA GAG GGA AAA AAA 
AAA AAA AAA AAA AAA AAA (SEQ ID NO: 1 1)). This PCR reaction was undertaken 
with AmpliTaq DNA polymerase according to the manufacturer's recommendations (Perkin 
Elmer/Roche, Branchburg, NJ) for three cycles at 94°C (30 s), 50°C (30 s), and 72°C (30 s). 
followed by 30 cycles at 94°C (30 s), 55°C (30 s) and 72°C (30 s). This generated a product 
of 1 84-bp (fragment A 1 ) through annealing of the homologous PR- 1 promoter sequences; the 
fragment included a BstEII site from the PR-1 promoter at its left end and a Xbal site at its 
right end. A second PCR reaction was done using the same template with a left-to-right 
"topstrand" primer comprising 22-bp of PR- 1 promoter sequence (positions -730 to -708 
upstream of the PR-1 ATG) and a further 10-bp containing a Xbal restriction site (primer 
LS 1+: GCT CTA GAG CAA TCT CAA TGG GTG ATC TAT TG (SEQ ID NO: 12)) and a 
right-to-left "bottom strand" primer extending from positions -584 to -607 upstream of the 
PR-1 ATG (primer PR1R: TAT TTG TTT CTT AGT GTT TCA TGC (SEQ ID NO:8)). This 
PCR reaction was done under the same conditions as the one described above and generated a 
fragment of 1 94-bp (fragment B I ) through annealing of the homologous PR- 1 promoter 
sequences; this fragment included a Xbal site at its left end and a Ndel site from the PR-1 
promoter at its right end. 

Fragments A I and Bl generated above were gel purified using standard procedures to 
remove oligonucleotides. Fragment A 1 was cleaved with BstEII and Xbal (all restriction 
enzymes were purchased from Promega, Madison, WI) and fragment B 1 was cleaved with 
Xbal and Ndel. Both fragments were ligated into plasmid pLTD6D that had previously been 
digested with restriction enzymes BstEII and Ndel, resulting in plasmid pLS 1. 
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' Piasmids pLS2 to PLS, we. c~d using tne same strategy. Fore., — a 
„ A (A2 to A9> «. ampU«.d using ^ 

..^omstraod- prime, (LS2- «. LS»-. respective*. - » (B . " ™ ~ 
ampUfied using a ieft-to-rigr, -.opstrand- pnmer <LS2 + to LS9 + , respect-ve y, and 
Z~~* primer «* Iragments were ge, purifed - -*«^ 

fragments B). Corresponding pairs were ..gated .nto P L UbU 
in plasmids pLS2 to pLS9. 

pnnKr Ls2 -: GTC CTA GAG CTA TCC AAA AAG AAA AAA AAA AAA A (SEQ ID 

Ntt '?«J ■ GTC CTA GAT ACA TTG AGA TTT ATC CAA AAA G (SEQ .D NO: .4, 
' GTC CTA G A A TAT AGATCACCC ATT GAG ATTT (SEQ ID NO:l5) 

GTC CTA GAT TG A AAC AGT CAA TAG ATC ACC (SEQ ID NO: 16) 
^ ^ GTC CTA GAG GGT OAC GTA GAG AAA CAC TCA A ,SEQ ID NO,7> 
' , ^ GTC CTA GAA AAA GTA AAA TAG TGA CGT AGA G (SEQ ID HO: ,S) 
primer LS8-: GTC CTA GAT TTC TAT G AC GTA AGT AAA ATA GTG (SEQ ID 

Na,9) LS o ■ GTC CTA GAC GTG CCG CCA CAT CTA TGA CGT A (SEQ ID NO:20> 
Pnmer , J : G^C CTA GAA TGG TGA TCT ATT GAC TGT TTC TC (SEQ .D NO*., 
" ^ GTC CTA GAG ATG ACT GTT TCT CTA CGT CAC (SEQ ID NO:22, 

Pn ™ r S t- Zc CTA GAA TTC TAC GTC ACT ATT TTA CTT AC (SEQ ID NO:23, 
primer LS5+: GTC CTA GAT ATA TTT TAC TTA CGT CAT AGA TGT G (SEQ ID 
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LS6+ - GTCCTAGAGAACGTCATAGATGTGGCGGCA<SEQIDNO:2 5) 

GTC CTA GAT GTG TGG CGO CAT ATA TTC TTC AG (SEQ ID NO:26) 
primer GTC CTA GAT TTA TAT TCT TCA GG A CTT TTC AGC (SEQ ID NO:27) 

primer LS9+: GTC CTA GAA TAG GAC TTT TCA GCC ATA GGC (SEQ ID NO:28) 

B. ConstructionofpLSIOtopLS12 

F „ eacn construe, a fragment A <A,0 to A,2) was produced . 
(ert-to-righftopsuand- primer Anc, and a rignt-to-ieft -be*—* ^-(LSiO 
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i» ». PR-, promoler were nsed /"f 8 PR ;' fences „ d teired 

resulting in plasmids pLSlO to pLS12. bef ° re < 
Primer LSI0-: GTC CTA GAA CGA AGA ATA TAT GCC GCP A r ,« 

ID NO:32) "* ° TC ATA GA ° ATA CTC A (SEQ 

primer LS 1 03: TAT GAG TAT nr t a t ^ * ^ 

N Q 33> A G TAT CTC TAT CAC TCT TGC CTA TGG CTC CT (SEQ ID 



primer LSI] 2 
primer LSI 13 
primer LS 122 
primer LSI 23 



Zr CAC T TAC TCA <** "> NO:34, 

TAT GAG TAT CTC TAT CAC TCT TCA T (SEQ ID NO:35) 

CTA GAT AGA GAT ACT CA (SEQ ID NO:36) 

TAT GAG TAT CTC TAT (SEQ ID NO:37) 



C. Construction of pLS13 



A PCR fragment was produced as de><irr\u»A u 
« * purtBed . dlfested wl ,„ BstEn ^ ^ ^ 

resulting in pLS 1 3. Pi-TD6D as desenbed before. 
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Example 7: Transforation of Arabidopsis 



A. Construction of P CIB200 (Schrn idhauser and Helinski, J. 

TJS75 Kan was ftrst created * »~» £ gMe , follow ed by 

Bacterio,. ,«= «M« m *":„ g ,, an<) vierra. ... Gene ,,. **» 

0982 » carrying a W^- PC1B200 b0Biers , . pl a„t seiectabie 

B.Construclton of binary vectors -ih restriction enzymes Kpnl and SacT. The 

r^en.scontaining.hePR-1 p,„ m o,e,-GUS .0 wpCB MI3. 
J Kp n. and Sac, si.es of pOKOO. resu„,n 8 .n piasmrds p 
IS respectively- 

C. vacuum inffltxauon of Arabtdopst ^ 
T „e binary vector constructs descrtbed tu tn ^ ^ 

Agr obacteriutn urmefacieus strain OV3,0, (Berchtold^N^et ^ ^ ^ 

„ V3. V *m. Arabidopsiswastrar.formedus ^ sce dswere 

r^'— — ' 

D. Transformation of Matze ^ MnrfoBreli im o nuize 

j k„ Krvnel ef ai-i Biotechnology i ' 
„ using the method descnbed by Koz.el *r , 
" b ombardment into ceils of immature embryos. 

E. Transformation of Wheat transfo rmed into immature 

Thebinaryvectorcons— ^ 
whe at embryos and immature embryo-denvedcal 

M by VasiU««/-. Biotechnology U: 1553 
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Example S.Dete^nationofthelnducibility of GUSe,n" • , 

«y of GUS expresaon by Chemical Regulators 

A. Treatment with IN A 

duplicate was kept « comro| " m8/ "" mA 0*>"'«>t,»ic acid) while fc 

6 ^ laBr ' -** «■» — d. deep ta e„ ^ 

B. Beta-Glucuronidase (GUS) Enzyme Assay 

Fro2enlMf «»ue was ground ina . 
*og t „ ,o produce , fine powder ™* 3 f -" " »» >«— 
- sodiun, ptesptae pH7 0 . „ , % - * « ^ i- 00, extraction buffer (50 

-~*«- —0 » deseed by fcffers0 * 1 * ^ '° mM 
The .actions a„ carried ou, in 'J^Z ' ^ * M4MW ( " 8 <» 

■ 65 pJ of GUS assay buffer °"~«*- * -■«■* -Op., of ex M „ wiln 

carbonate. The concent™ inn „f n ^ M sodlum 

o, a Fl „ w u ::;T;;:r '; ndica,or reteed ~ d ~ * «*. - 

«* s« mp ,es were averaged Jd b T ^ ^ 

each extract „ as dBlenninedllsi h . sam P k - The "mount of protein in 

defined for each san.pie a „d was " reM «°°°n, The specific Bi „» was 
Table 2 shows J '" ^ MU '™* P™tein/minute. 

*« - expressed - p Je M^" 8 ~" ~— «• ODS 

^pendent transgenic „„, „. induction of GITS e '" " ^ 
dividing the specific acUvity of t he J A , T * 
»— contro, sa.oie ^ ^ * of the 



untreated control sample 
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Table 2 

Construct Control M . 

LSI ^ 250 Fold Inri.,rH»n 



LS2 69 £ u 5.9 20 

LS3 76 «8 9. 8 2Q 

LS4 3Ifi y 76 10.4 2n 

roc JI6 4,073 ™ « 

LS5 302 | , n , 20 8 20 

LS6 I94 ' 2 2 3 02 6.0 20 

LS7 ,04 ?S 45 20 
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N 



LS8 58 i!! » 4 20 

LS9 169 '2.8 J9 

LS10 2 07 „i 4.7 20 



LSn 270 6^ L7 3 0 

LSI2 96 603 4.8 , 9 

LSI3 I28 611 , 18 2Q 

wt 107 997 53 20 



11.6 



21 



AS shown in Tab,, 1, mosl ls mutalioris „ ad „ o 

co„i, c„„Z'ri average induc "° n ° f ° us * mA - - 

01 construct and approumattvely 4-fold and 3-fold hijher av«r»„ r-i ,c • 
and water-treated samp,es. respective*. This suggested tha!^ * ^ s «.v„ y forINA . 

h ^ k ' '"'s suggested that a negative regulatory elem-nr 

» . a compiete ,oss of inducinihty of GUS activity by INA was observed ' ^ 

v^ues for both wa*r and INA .reagents in „ ansgeI , ic „„„ contoli „ g ^ZZ^ , 

ou i:;;: s, * , 7 ,8ter ^° f -' i ---ah i gh U „,„d UC eda„d „ d r; 

ideations. U «'°"° f "K->I-=e*p re ss,on INA ,„ or near the LS7 and LS,0 



C. Treatment with BTH 



Instead of IN A as described above in Examole 8A ni,n t > - , • 
as deseed by Goeriach „ „. „, « ^ ~' «* ™. 

04 J • < 1 "o): Fnednch <>r a/. , p/^, 
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subjected a GUS assay as described above in Example 8B. 
n Treatment with salicylic acid (SA) 

° ^ of rNA - described above in U p.an, macerial * «* 

above in Example 8B. 

D. /n-vivo Footprinting Analysis 
Example 9: ln-vivo Footprinting 

four week, they were sprayed wnh 0.65rnM ^ ^ ^ 

— - " m ^ ~ 1 11 » ~* water. Has* in 
Shames, for 2 nun « n- » ' . ^ CTAB rrKthod. 

liq „id nitrogen and lyophutsed for 2 days. DNA 1 % CTAB, 0.7M NaCI. 

„ 10m MEDTA..%b. mCT capto eI h»no.at60Cfor45 _ 

„ _ temperature. T* P.cpua,. ^ wdiura _ and , 5 

-«"» 3TCDNA *" :~:r™l- 1 eaved wi ,,Mp i p« i d in e f o t 30 
volumes ethanol and resupendedtnTE buffer. 

„ m i„ a, *,C, lyopnilised 3 times and suspended in dionrsed -~*» - 
" genomte DNA was treated with 0.5% DMS for 30 sec a, room temperature 

pyridine as deseed a^ve. ^ ^ p R ^ ^ „. 

Ligation-medtatedPCRCLMPC a l plant Molecular Biology Manual B 1 8: 1 -34 

Science 246:780-786 < 1 989) and Br.gnon. P. e« al. Plan Mo, 

heated at 95°C for 5 min, annealed at 50 C for 
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out at 42°C for 5 min with Sequenase Version 1 .0 (USB Biochemical*). A unidirectional 
staggered linker formed by LMPCR2 (GAA TTC AGA TC (SEQ ID NO: 40)) and LMPCR3 
(TGA CCC GGG AGA TCT GAA TTC (SEQ ID NO: 41)) was ligated to the blunt-ended 
DNA molecules for 15 hours at I7°C. Exponential PCR was carried out with primers 
LMPCR3 and P2- (ACT TTA TAT CTA CAG TCA ATT TTC AAA, -502 to -529 (SEQ ID 
NO: 42)) using KCI-based Taq poymerase buffer supplemented with 2.5mM MgC12 under 
following conditions: 16 cycles at 94°C/1 min, 55°C/2 min, 74°C/3 min and an addition of 5 
sec to the extension step for every cycle, followed by one cycle at 94°C/lmin, 55°C/2 min. 
74°C/10 min. The end-labeling PCR was carried out using primer P3- (GTT TAT ATC TAC 
AGT CAA TTT TCA AAT AAA AG. -503 to -535 (SEQ ID NO: 43)) in 5 cycles at 
94°C/i min, 60°C/2 min. 76°C/3 min. Non-coding strand analysis was carried out similarly 
using primer P41+ (CTT TTT TTT TTT TTT TTT TTT TTT TTT TC. -735 to -706 (SEQ ID 
NO: 44)) for first strand synthesis annealed to plant DNA at 50°C. A unidirectional staggered 
linker formed by LMPCR2 (GAA TTC AGA TC (SEQ ID NO: 40)) and LMPCR1 (AGT 
TAC TAG TGA GAT CTG AAT TC (SEQ ID NO: 45)) was ligated to the blunt-ended DNA 
molecules and exponential PCR was carried out with primers LMPCRI and P52+ (TTT TTT 
TTT TTT TTT CTT TTT GGA TAA ATC. -722 to -692 (SEQ ID NO: 46)) using an 
annealing temperature of 55"C. End-labelling PCR was done with primer P53+ (TTT TTT 
TTT TTT TTT CTT TTT GGA TAA ATC TC, -722 to -690 (SEQ ID NO: 47)) using an 
annealing tempetrature of 60°C. Amplified fragments were separated on a 0.4mm thick, 6% 
polyacrylamide gel and dried for 30 min at 80°C. BioMax MR films were used without 
intensifying screen. 

Analysts of the coding strand revealed inducible footprints at positions -629 and -628 
and at position -604 (FIG. 1 ) (nucleotides 3630, 363 1 , and 3655 of SEQ ID NO: 1 , 
respectively). At both sites, an increased band intensity after INA treatment was detected, 
indicating deprotection of these guanines upon INA treatment. These two footprints are 
located in sequences mutated by linker-scanning constructs LS8 and LSI 1 (FIG. I). Neither 
construct seemed to significantly influence the regulation of the promoter, but they are 
surrounding the sequence mutated in construct LS10, which had lost inducibility by INA. 
This suggests that that the region of the PR-1 promoter around this site undergoes changes in 
its occupancy by DNA-binding proteins. No other change in band intensities could be 
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detected in the range of DMS concentrations tested (0.04% to 1 %). suggesting that the 
described sites are the only ones affected after IN A treatment. 

Examination of the non-coding strand revealed an inducible footprint at position -641 
(361 8 of SEQ ID NO: 1). In this case, too, deprotection was observed. The guanine at 
position -641 is included in the sequence mutated in linker-scanning mutant LSI, which had 
lost inducibility by INA and that contains a recognition site for basic leucine zipper 
transcription factors at posit.ons -645 to-640. Different DMS concentrations did not reveal 
any other inducible footprint. 

These results show that the examined region undergoes changes in protein-DNA 
interactions and suggest that the above described Cements are required for the induction itself 
("switch" element) and are not just binding sites for constitutive transcriptional activators. 

E. Methods for Isolating Transcriptional Regulatory Proteins 

Example 10: Screening of Expression Libraries (South-Western) 

An oligomer containing defined parts of the PR- 1 promoter, such as the sequence of 
LS4 LSI, or LS10, is used to screen a cDNA expression library (Singh, H. et al. 
Biotechniques 7:252-261 (1989). A cDNA expression library is plated and the proteins are 
transferred onto a nitrocellulose filter. The filter is probed with a radiolabeled oligomer 
contaimng one or more copies of the sequence of interest. Clones expressing protein, that 
bind to this sequence are detected by autoradiography and isolated. 

Example 1 1 : Yeast One-Hybrid System 

An oligomer containing defined parts of the PR-1 promoter, such as the sequence of 
LS4, LS7, or LS 10, is used as a bait in a yeast one-hybrid system (Li, J.J. and Herskow.tz. I. 
Science 262- 1870-1874 (1993)). The chosen sequence is fused upstream of a minimal 
promoter and a reporter gene and transformed into yeast. The resulting yeast strain is 
n transformed with a cDNA expression library fused to a yeast activation domam. Upon 
specific interaction between the bait and a fusion protein, transcription of the reporter gene 
activated. The corresponding clone is isolated. 
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F. Methods for Construction of Inducible Hybrid Promoters 
Example 12: Inducible Hybrid Promoters 

The above described elements (one repeat or preferably several repeats) are fused 
m.mm.1 promoter in order to obtain inducible gene expression. For example, the re gI on of 
the PR- 1 promoter spanning LS7 through LS 10 (nucleotides 3614-3653 of SEQ ID NO- 1) 
may be used to confer inducibi.ity to a promoter fragment. Transcriptional enhancer elements 
are als0 included into the synthetic promoter in order to obtain increased gene expression. 

Example 1 3: Inducible Tissue- or Organ-Specif.c Promoters 

The elements described above in Example 12 (one repeat or preferably several repeats) 
are fused to or included into promoters that confer tissue- or organ-specific gene expression 
in order to obtain inducible gene expression in a particular tissue or organ. 
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SEQUENCE LISTING- 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Lebel, Edouard 
Ryals, John 
Thome, Leigh 
Uknes, Scott 
Ward, Eric 

(ii , TITLE OF INVENTION : Chemically Inducible Arabidopsis PR-1 Promoter 
(iii) NUMBER OF SEQUENCES: 51 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Novartis Patent Department 

!b) STREET: 540 White Plains Road, P.O. Box 2005 

(C) CITY: Tarrytown 

(D) STATE: New York 

(E) COUNTRY: USA 

{ p) ZIP: 10591-9005 

(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 
tn) COMPUTER: IBM PC compatible 
C OPERATING SYSTEM: PC- DOS /MS-DOS 

IS SOFTWARE: Patentln Release #1.0, Version #1.30 

(viii) ATTORNEY / AGENT INFORMATION: 
t A) NAME: Meigs , J. Timothy 
m\ RFGXSTRATION NUMBER: 3 8,241 
( (C) rSeSnCE/DOCKET NUMBER : CGC 1873/PCT 



fix) TELECOMMUNICATION INFORMATION: 
1 (A) TELEPHONE: (919) 541-8587 

<B> TELEFAX: (919) 541-8689 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: promoter 



\l\ ^I^ORMATION: /note- Tull-lenath Arabidopsis 
PR-1 Promoter Sequence" 



(ix) FEATURE: . 

(A) NAME/ KEY: TATA_s ignal 

(B) LOCATION: 4229.-4232 
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(ix) FEATURE: 

U) NAME/KEY: mi SC _signal 
B LOCATION: 4294. 
(D) OTHER INFORMATION- 
translation" ««axion. /note= "Start codon for 

(ix) FEATURE: 

IS! KK£: sn-'sjr 

construction" 9raent in Piasmid pLTD6D - used for rc 

ea tor Ls construct 

( ix) FEATURE; 

(A) NAME / KEY : misc.feature 

(B) LOCATION ; 3444.. 4258 
(D) OTHER INFORMATTnM. / 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
B) LOCATION: 3561.. 4258 



thaliana 



(D) OTHER INFORMATION : /„ ote = "698 h„ i 
™-l Procter fragment in iaslTptS^^" 
(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 3638? 4258 

(D) OTHER INFORMATION- ., 
tHa lian a PR - 1 promoter f»Si t / ^r.^i 2 i-STJ^. Ar * bi **- i - 
(ix) FEATURE: 

(A) NAME/KEY: niisc feature 
B) LOCATION: 3 5547.3^63 

OTHER INFORMATION: / note = " Lsl . 
( ix ) FEATURE : 

(A) NAME /KEY: misc_featur e 

(B) LOCATION: 3564 3573 

(D) OTHER INFORMATION: /noCe= »LS2» 
(ix) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 3574 3583 

(D) OTHER INFORMATION: /note= »LS3 » 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 3 584 3 593 

(D) OTHER INFORMATION: /note= »LS4" 
( ix ) FEATURE : 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 3594.. 3603 

(D) OTHER INFORMATION: /note= «LS5» 
(ix) FEATURE: 

(A) NAME /KEY: misc.feature 

(B) LOCATION: 3604 3613 

(D) OTHER INFORMATION: /note= " LS 6 " 
( ix ) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 3614.. 3623 
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(D) OTHER INFORMATION: /note= " LS7 " 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 3624.-3633 

(D) OTHER INFORMATION: /note= "LS8" 

(ix) FEATURE: 

(A) NAME / KEY : misc_ feature 

(B) LOCATION: 3634.. 3643 

(D) OTHER INFORMATION: /note= n LS9 M 

(ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 

(B) LOCATION: 3644.. 3653 

(D) OTHER INFORMATION: /note= "LSIO" 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 3654.. 3663 

(D) OTHER INFORMATION: /note= "LSll" 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION: 3664.. 3671 

(D) OTHER INFORMATION: /note= "LS12" 

(ix) FEATURE: 

(A) NAME / KEY : rnisc_f eature 

(B) LOCATION: 3672.. 3681 

(D) OTHER INFORMATION: /note= n LSl3" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CTCGAGTTAT 


TTTCAAAAAG 


CAGTATCGGC 


TAGAGCATCA 


AGAAACTCGA 


TTAAAAGTTT 


60 


ATCAAATGAA 


GCTGGAAAAG 


TCTTGACAGA 


GTTAGAAGAC 


ATTAAAATAG 


AGGTTGTGGA 


120 


GTATTGTAAA 


AGAATTATGC 


AAGCGCAACC 


TACATGACTA 


GTGGAGATAT 


CATATGACTA 


180 


CTTATCAGGG 


TTGGTGAATT 


TTAAATATTC 


CCAGACTGCT 


GCTAAAACCT 


TAATGCATCC 


240 


AATTGGTGTT 


GACGAGATGG 


CTATCCTGCT 


CAGTTTTTAA 


TTGCTGCTTG 


GTCGATCTTG 


300 


GAAATGATTT 


TATAATAGCA 


GTGCAGTCCT 


TCTTGATCTT 


TGGATGCATG 


CCCAAAAGTG 


360 


TTAAATCAAC 


TATTCTGACT 


CTTGTTCCCA 


AAACAGATGG 


TGCACAAAAT 


ATGAAGGAGT 


420 


TCATACTGAT 


AGCGTGATGC 


AATCTTCTAT 


ATTAAGTGAT 


ATATAAGATA 


ATAGCAAACC 


480 


GGCTTAAAGT 


TACTTTACAA 


GAGGCGATGG 


AACCGAATCA 


GAGCACCTTT 


GTGAAGGGGA 


540 


GGCTCTTACT 


AG AG AAC ATA 


TTTTTAGCAA 


CAAAACTAGT 


CAAGGACTAC 


CACAAGCAAT 


600 


CACTCTCATC 


TCGTTTAGCA 


ATTAAGCTTG 


ATATCTCTAA 


AGCGTTTGAC 


AT AG AG C AAT 


660 


GGCCGTTTAT 


TGCTGCTAGG 


CT AC GTGTG A 


TGGGTTATCC 


AT AG CTCTTT 


AT AC AC T AG A 


720 


TAAATATATG 


CATCTCTACG 


TCCTCGTTTT 


GTTTTTTTCT 


CTAGCTCTTG 


TGGTATAAGG 


780 


AAAGGATGCT 


CTCTTTCACC 


GTACTTCTAT 


GTTATCATCA 


ACAATGTTTT 


GTCGACTATG 


840 


TTAAACAGAG 


CAGCTGTTAT 


GAAAGAGATT 


GGTTCTCACC 


CGTTTTGCAA 


GGAGATAAAG 


900 
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CTTACACATC 
CTCTGCAACA 
GTGTCCAAGT 
ACATCAGTAG 
CAAAAGCAAT 
TAATTGGACA 
TGAAGCATCT 
TGAAAGGAGG 
AGTGTCTTGG 
ATGGAGATGT 
TCTCTATGGG 
AAAGCAACGT 
ATTGAGTTTA 
TGGTTGTCAA 
AGTGCACCGA 
TGGTCAGAAC 
ACACTCGGCT 
TTTAAAGTAA 
ACATATTG T A 
AAGCAAAGAG 
AATGTTTGCT 
TTAAAAATAA 
GCATGTAACC 
GTTAACCGCT 
TTTTAAATCT 
ATTTTACCAA 
ATAGATTGTT 
GGGGATTTAT 
ACTCTAATTT 
TTCTAATTTA 
TTCTATAGTG 
GTCTTTTCTA 



TTAGTTTTGC 
TCATGATAGT 
CCACAATATT 
GGTTAGTAGT 
GACGAGACTT 
AGCAAGCACC 
TAATTTTCTG 
TGTAGTTCAT 
GAGGAGGTTT 
CTTTGATTTA 
TGGCATAGAT 
CCTTAGGGTC 
TAAGAATGGA 
TGGGGAGTCT 
GATGCTACAG 
CAACATATAA 
AGTCAAGATC 
ATTGTGTCCT 
TTGTGTGTTT 
AATACATACT 
TCCTTACAAT 
CTAATTTTTA 
TTCATTTATC 
AAAATCTCAT 
AATAGAAAAT 
AAAGAAAAAA 
GAAAAGGTTT 
TCAATCCAAA 
GCTTCATCAA 
CATTGGTTTC 
TGATGACAAA 
GTTTTGAGAG 



TGATGATATT 
GGTTGATGAG 
TGATGCGGGT 
AAGTTCTCTT 
GACTACAAAC 
TCTCACTTGA 
GTGTTCAGTC 
TCCTCTAGAG 
GCTACTCAAA 
TGCGTTGAGC 
AAGTCATTAC 
TTCGGTTGGA 
AG T AAAAGAT 
TATTGATCGT 
TTGCAGAGGT 
GTTTGATTGT 
ATTGATTAGT 
ATACAAAGAA 
AAAGTAAATT 
TACGTAGGAA 
TCATACTTAT 
TTAGGCAGCA 
ATTCATTTTA 
TTTATCCGTT 
GCCATCTGGC 
CTTATACGAC 
ATTCATTACT 
GAAAAGCTTC 
TCTTAGGACT 
GTGTTTTTTC 
AAAAAAATGT 
TTTAAGGTTT 



ATGGTCTTCA 
TATGCCCATA 
CGAGGGAAGA 
CCCATTTGGT 
CTCTACTTGA 
GGTTGTCTAC 
TTCAGGCTTC 
TGGATCATCG 
AAAGGAAGGG 
CTAATATGGA 
CTTCTGCGCC 
CGTAAGCTGC 
GGAGTTAAGA 
AGTTGGAGAA 
TGTAGCAAGA 
GGACCAGATC 
ATATATACAT 
TATATATAAC 
GTGTCCTATA 
TCTTTTTGTT 
TAACTTACAT 
AGTCATTTAC 
GTTTGTAACT 
CGCATTAAAG 
AAATAAACAA 
TTTTCTTGCT 
TTCATGCAGA 
CAAAAACTGA 
GACTTTTCCA 
GAAAGGAGAC 
AATTGTTAGT 
ATGATCGGAA 
38 



TGGATGGTAC 
TTTCAGTTTT 
TGACTTTGGA 
ACCTTGGGCT 
CAAGATAAGG 
AACTTATGAA 
CAAAAAATGT 
CTTGATGCAA 
GGCTTGGGGT 
GGTTATATAC 
AAGAATCATT 
TCAAGCTTTG 
CACGATCCTA 
AGGGGAACAT 
GGTCACTGGT 
ATAGCTAAAG 
ATTGTATTGC 
GATCATTGAT 
CAAAGAATAT 
TTCTTTTTTC 
ATTATGTTTT 
AAAGTAAAAA 
TTTTATTAGA 
TTAAATAGAT 
CGGACACGAT 
TAGAAGTCTT 
GAGATAACAT 
CTCTGCTTCA 
AWYCAATATG 
AACTATCTTT 
TGCAAAAGAA 
CTTAGAGTKT 



TCTTGGTTCT 
TAACATCAAT 
AATAGGGGCC 
GCGCTAACCA 
TCTCGTTTTT 
CTCAGTTATA 
TTTTAGACAT 
CTAAAGCAAA 
TCCGCGTATG 
CATGTCGGGC 
TTGGGATATC 
CCCACAAGCC 
GTCGGATACT 
ATGAATTGGG 
CAATCCGTCG 
ACCCGTCCGT 
ATGAAAAGTG 
TAGTATATAT 
CTTTGTGGAG 
ACAACGTAAG 
CTTTTAAATA 
ATTTCTCCAT 
TTTTGATCAA 
TGCTGACATA 
TTTAAACTAA 
TGCATTGTTA 
ATCATCGCGT 
TGAAACACTC 
CGGAACTATC 
TTAAAAGCTT 
AAGTACAATA 
AAATTTAAAC 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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TATTTTGTTA ATTTTTGGAC TGATAACAGT TTTTTTTTGA AAATATTGAA ACGTTGTTTA 28 80 

CCTAATGTAA CATGTTATTC TACTTAAATT ACTTTATATT TTAATAACAT ATAATATTGA 2940 

ATAGGATATC ATAGGATATT ATTACGTAAT AATATCCTAT GGTGTCATTT TATAAGTTAG 3000 

CACAAGCTTG TTTTAACTTA TAAAATGATT CTCCCTCCAT ATAAAAAAGT TTGATTTTAT 30 60 

AGAATGTTTA TACCGATTAA AAAAATAATA ATGCTTAGTT AT AAA TT AC T ATT T ATT CAT 312 0 

GCTAAACTAT TTCTCGTAAC TATTAACCAA TAGTAATTCA TCAAATTTTA AAATTCTCAA 3180 

TTAATTGATT. CTTGAAATTC ATAACCTTTT AATATTGATT GATAAAAATA TACATAAACT 3 240 

CAATCTTTTT AATACAAAAA AACTTTAAAA AATCAATTTT TCTGATTCGG AGG GAG TATA 33 0 0 

TGTTATTGCT TAGAATCACA GATTCATATC AGGATTGGAA AATTTTAAAG C C AG TG CAT A 3 36 0 

TCAGTAGTCA AAATTGGTAA ATGATATACG AAGGCGGTAC AAAATTAGGT ATACTGAAGA 3 420 

TAGAAGAACA CAAAAGTAGA TCGGTCACCT AGAGTTTTTC AATTTAAACT GCGTATTAGT 3 48 0 

GTTTGGAAAA AAAAAACAAA GTGTATACAA TGTCAATCGG TGATCTTTTT T TTTTTTTTT 3 540 

TTTTTTTTTT TTCTTTTTGG ATAAATCTCA ATGGGTGATC TATTGACTGT TTCTCTACGT 3 600 
CACTATTTTA CTTACGTCAT AGATGTGGCG GCATATATTC TTCAGGACTT TTCAGCCATA 
GGCAAGAGTG ATAGAGATAC TCATATGCAT GAAACACTAA GAAACAAATA ATTCTTG AC T 
TTTTTTCTTT TATTTGAAAA TTGACTGTAG AT AT AAAC TT TTATTTTTTC TGACTGTAAA 
TATAATCTTA ATTGCCAAAC TGTCCGATAC GATTTTTCTG TATTATTTAC AGGAAGATAT 

CTTCACAACA TTTTGAATGA AGTAATATAT GAAATTCAAA TTTGAAATAG AAGACTTAAA 3900 

TTAGAATCAT GAAGAAAAAA AAACACAAAA CAACTGAATG ACATGAAACA ACTATATACA 3 960 
ATGTTTCTTA ATAAACTTCA TTTAGGGTAT ACTTACATAT ATACTAAAAA AATATATCAA 
C AATG GC AAA GCTACCGATA CGAAACAATA TTAGGAAAAA TGTGTGTAAG GACAAGATTG 

ACAAAAAAAT AGTT ACG AAA ACAACTTCTA TTCATTTGGA CAATTGCAAT GAATATTACT 4140 

AAAAT AC TC A CACATGGACC ATGTATTTAC AAAAACGTGA GATCTATAGT TAACAAAAAA 42 00 

AAAAAGAAAA AAATAGTTTT CAAATCTCTA TATAAGCGAT GTTTACGAAC CCCAAAATCA 4260 

TAACACAACA ATAACCATTA TCAACTTAGA AAAATGAATT TTACTGGCTA TTCTCGATTT 4320 

TTAATCGTCT TTGTAGCTCT TGTAGGTGCT CTTGTTCTTC CCTCGAAAGC TCAAGATAGC 43 80 

CCACAAGATT ATCTAAGGGT TCACAACCAG GCACGAGGAG CGGTAGGCGT AGGTCCCATG 4 4 40 

CAGTGGGACG AGAGGGTTGC AGCCTATGCT CGGAGCTACG CAGAACAACT AAGAGGCAAC 4500 
TGCAG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

39 
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(C) STRANDEDNESS: sin „ lD 
«D; TOPOLOGY; linear"^ 6 

(iij MOLECULE TYPF- ^f-u 

(A, DESCR^ION ^^-fc-cid 
SXtl C " 01 ^nucleotide prilner 

(iii) HYPOTHETICAL: NO 

Uv) ANTI-SENSE: NO 

<Xi ' SE ° UENCE ASCRIPTION: SEQ ID NO , 2 . 
AAGAGCACCT ACAAGAGCTA CAAAGACG 

(2) INFORMATION FOR SEQ ID NO: 3 : 28 

(i) SEQUENCE CHARACTERISTICS - 
<A) LENGTH: 17 base 

(B) TYPir. „ i • Pairs 
io; type.- nucleic acid 

(C) STRANDEDNESS- s i„L 

(D) TOPOLOGY: l ine ar 9 

<ii) MOLECULE TYPE- «t-k 

««. .-^^ „ 0 " 0U — «w .... 

(iv) ANTI -SENSE : NO 

CXI) SEQUENCE DESCRIPTION : SEQ I D NO : 3 • 
GGCAAAGCTA CCGATAC 

(2) INFORMATION FOR SEQ ID NO: 4: 17 

(i) SEQUENCE CHARACTERISTICS • 
A LENGTH: 20 base p2rs 
) D> T *PE: nucleic acid 
(CJ STRANDEDNESS sinal. 
(D). TOPOLOGY: linear 9 

<ii) MOLECULE TYPE • 
«Ui, HYPOTHETICAL : NO " 01i ™""- — - 

<xi> SEQUENCE DESCRI PTIO n : SEQ „ NO:4 . 
GGACGTAACA TTTTTCTAAG 

' 2 ) . INFORMATION FOR SEQ ID NO : 5 : 20 

<i) SEQUENCE CHARACTERISTICS • 
LENGTH: 20 base pairs 

40 



WO 98/03536 




PCT/US97/12626 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide primer 1c" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CTTAGAAAAA TGTTACGTCC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide primer Id" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TTACGCTGCG ATGGATC 

( 2 ) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide primer 

Nl, 076" 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACCGCTCGAG AATTTTTCTG ATTCGGAGGG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE • 

(A, D E SCRX Y p P ? ro °r h ^c C -^Oi aCid 
PR1R C " 0 ^ffonucleotide primer 

(iii) HYPOTHETICAL: NO 



) 



<*i> SEQUENCE DE SCRI PTX0N: S£Q rD 
TATTTGTTTC TTAGTGTTTC ATGC 

(2) INFORMATION FOR SEQ ID NO: 9 : 24 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH : 30 base pair, 
<B) TYPE : nucieic acid 

(C) STRANDEDNESS: simxl* 

(D) TOPOLOGY: linear ^ 

(ii) MOLECULE TYPP- ^*.u 
W ' 81 Oligonucleotide primer 

(iii-) HYPOTHETICAL: NO 

<Xi) SEQUENCE DESCRIPTION: SEQ » NO . 9 . 
ACCGCTCGAG ATAAATCTCA ATGGGTGATC 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 30 

(i) SEQUENCE CHARACTERISTICS- 
J LENGTH: 30 base 
iu TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPF- ~<-u 

N?04 .. (A) DESCrSxon h /Ls n c C - le ^ aCid 

° ~ 01l S°nucleotide primer 

fiii) HYPOTHETICAL: NO 



fx*) SEQUENCE DESCRIPTION: SEQ ID No . 10 - 
ACCGCTCGAG TTCTTCAGGA CTTTTCAGCC 

(2) INFORMATION FOR SEQ ID NO: 11 : 30 
(i) SEQUENCE CHARACTERISTICS • 

„ IYp E: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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* 



(ii) MOLECULE 
(A) DESl 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GCTCTAGAGG GAAAAAAAAA AAAAAAAAAA AAAAAA 
(2) INFORMATION FOR SEQ ID NO: 12: 

<n SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID 
GCTCTAGAGC AATCTCAATG GGTGATCTAT TG 
(2) INFORMATION FOR SEQ ID N0:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 



(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ~ "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCCTAGAGC TATCCAAAAA GAAAAAAAAA AAAA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer ] 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
GTCCTAGATA CATTGAGATT TATCCAAAAA G 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sinale 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

ASCRIPTION: /desc = - pr i^ LS4- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15 : 
GTCCTAGAAT ATAGATCACC CATTGAGATT T 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION ; /desc --^pri^ LSS _, 



31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTCCTAGATT GAAACAGTCA ATAGATCACC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

^CRIPTION: /«»•« = "Primed LS6- 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:17 : 
GTCCTAGAGG GTGACGTAGA GAAACAGTCA A 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pai rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

. (A) DESCRIPTION : / desc = *p r S tS7 _. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GTCCTAGAAA AAGTAAAATA GTGACGTAGA G 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTCCTAGATT TCTATGACGT AAGTAAAATA GTG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LSy- 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

31 

GT C C TAG AC G TGCCGCCACA TCTATGACGT A 
(2) INFORMATION FOR SEQ ID NO: 21: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS^+ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
GTCCTAGAAT GGTGATCTAT TGACTGTTTC TC 
(2) INFORMATION FOR SEQ ID NO: 22: 

45 



30 



WO 98/03536 

p CT/US97/12626 

fB) typp. « , ? ase Pairs 
(iij MOLECULE TYPP- o-u 

esc - 'Primer LS3+- 
SEQUENCE ^CHX PTION: SEQIDN0 . 22 . 

gtcctagaga tgactgtttc tctacgtcac ' ' 

(2) INFORMATION FOR SEQ lD NO:23 . 
< i > SEQUENCE CHARACTERISTICS • 

<c! lis- 

(D) TOPOLOGY^ linear 

(iij MOLECULE TYPE - 

<*> oBcwmoSf^:^ 0 acid 

aesc - Primer LS4+- 

' Xi) SEQUENCE D ^CRIPTX 0N: S EQIDno . 23 . 
GTCCTAGAAT TCTACGTCAC TATTTTACTT AC ' " 

<2) INFORMATION FOR SEQ I DN0:24: " 32 
(A) LENGTH: 34 base pa ' 
fD) TO POLOGY N ^ ne -^e 

fii) MOLECULE TYPP- rtt -u 

<a> descrJpJxo^ h / d r e ^ cl ^ C acid 

/aesc _ 'p rimer LS5+ ,. 

Cxi) SEQUENCE DESCRIPTION: S E Qi DNC , 24 . 
GTCCTAGATA TATTTTACTT ACGTCATAGA TGTG ' 

(2> IWPOR «ATION FOR SEQ IDNO:25 : 34 

U) S fWENCE CHARACTERISTICS • 

(A) LENGTH: 3 0 hast t*- ' 

(B) tvpi?. _ , r ase Pairs 
ri Imf nucleic acid 

(C) STRANDEDNESS • sin „i 
W TOPOLOGY: linear 

<ii) MOLECULE TYPE • ot-h- 

(A) n ES CRIPTIONf h /Lsc C -^ C aCld 

/aesc - -primer LS6+- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-25: 
GTCCTAGAGA ACGTCATAGA TGTGGCGGCA 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 3 2 base pairs 
(B| TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS7 + 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTCCTAGATG TGTGGCGGCA TATATTCTTC AG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

( (A) DESCRIPTION: /desc = "primer LS8+ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTCCTAGATT TATATTCTTC AGGACTTTTC AGC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS9h 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTCCTAGAAT AGGACTTTTC AGCCATAGGC 
(2> INFORMATION FOR SEQ ID NO: 29: 



<*) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE- ni-h=v _ ■, . 

(A, D ES C^PTION: h ^e"c C -"" 3Cid 

/aesc - primer LS10- 

(Xi, SEQUENCE DESCRIPTION : SEQ ID NO:29: 
GTCCTAGAAC GAAGAATATA TGCCGCCAC 
(2) INFORMATION FOR SEQ 10 no: 30: 

U) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE • ot-ho, „ , • 

(A) DESCRIPTION /desc"-*^ ? eid 

/aesc - primer LS11-" 

(xi, SEQUENCE DESCRIPTION: SEQ ID N O:30: 
GTCCTAGAAC GAAAAGTCCT GAAGAATATA TG 
(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 29 base pairs 
B TYPE: nucleic ac?d 

(C) STRANDEDNESS : sinale 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc - ^ * 

/aesc - primer LS12-" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31 : 
GTCCTAGAAA GCCTATGGCT GAAAAGTCC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH ; 37 base pairs 
B, TYPE: nucleic acid 
(C, STRANDEDNESS: sinole 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE • n m»^ , • 

(A, DESCRIp?i 0N th ^ e ^ C . le - r ac ^ 

/aesc - primer LS102" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CTAGAGGAGC CATAGGCAAG AG TG AT AG AG ATACTCA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS10J 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
TATGAGTATC TCTATCACTC TTGC CTATGG CTCCT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LSll^ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CTAGATGAAG AGTGATAGAG ATACTCA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS113 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 
TATGAGTATC TCTATCACTC TTCAT 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "primer LS122" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
C TAG AT AG AG ATACTCA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 15 base pairs 
fB> TYPE: nucleic acid 
■(C) STRANDEDNESS ; single 
<D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LS123 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37 : 
TATGAGTATC TCTAT 

15 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc = "primer LS13-- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 
GGAATTCCAT ATGCCAGAAG TCTTCACTCT TGCCTATGGC TGAAAAG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION; /desc = "primer Pl-» 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 39; 
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ATTTACAGTC AGAAAAAATA AAAG 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/12626 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LMPCR2 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 40: 

11 

GAATTCAGAT C 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LMPCKJ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

21 

TGACCCGGGA GATCTGAATT C 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer P2-" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AGTTTATATC TACAGTCAAT TTTCAAA 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer P3-" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GTTTATATCT ACAGTCAATT TTCAAATAAA AG 
(2) INFORMATION FOR SEQ ID NO: 44: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

{ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = -primer P41+" 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO:44: 
CTTTTTTTTT TTTTTTTTTT TTTTTTTTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer LMPCR1 " 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer P52+" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TTTTTTTTTT TTTTTCTTTT TGGATAAATC 



AGTTACTAGT GAGATCTGAA TTC 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer P53 + " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTTTTTTTTT TTTTTCTTTT TGGATAAATC TC 

(2) INFORMATION FOR SEQ ID NO: 48: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 base pairs 
{ B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "recognition site of the yeast 
transcription factor GCN4 M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TGACTG 

(2) INFORMATION FOR SEQ ID NO: 49: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : s ingle 
[ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " recognition site of the bZIP 
transcription factor CREB" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTACGTCA 

(2) INFORMATION FOR SEQ ID NO: 50: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: other nucleic acid . 

(A) DESCRIPTION: /desc = M recognition site for t>zir 
transcription factors" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
ACGTCA 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY : 1 inear 



kB" 



Mi) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = • consensus recognition sequence of NF 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

11 

GGGACTTTTC C 



54 



WO 98/03536 




# 



PCT/US97/12626 . 



What Is Claimed Is: 

An isolated DNA molecule comprising a nucleotide sequence selected from the 
following group: 

a) a full-length chemically inducible promoter fragment comprising nucleotides 1 
through 4258 of SEQ ED NO: 1 ; 

b) an 8 1 5-bp long chemically inducible promoter fragment comprising 
nucleotides 3444 through 4258 of SEQ ID NO: I; and 

c) a 698-bp long chemically inducible promoter fragment comprising nucleotides 
3561 through 4258 of SEQ ID NO: 1. 

The.isoiated DNA molecule of claim 1 , comprising the full-length promoter fragment 
comprising nucleotides 1 through 4258 of SEQ ID NO: 1. 

The isolated DNA molecule of claim 1, comprising the 8! 5-bp long PR-1 promoter 
fragment comprising nucleotides 3444 through 4258 of SEQ ID NO: 1. 

The isolated DNA molecule of claim 1, comprising the 698-bp long PR-1 promoter 
fragment comprising nucleotides 3561 through 4258 of SEQ ID NO: 1. 

A chimeric gene comprising the DNA molecule of claim 1 operatively linked to a 
coding sequence, wherein the DNA molecule regulates transcription of said coding 



sequence. 



The chimeric gene of claim 5, wherein said coding sequence encodes an enzyme. 



The chimeric gene of claim 6, wherein said enzyme is an assayable marker. 



The chimeric gene of claim 7 t wherein said assayable marker is GUS. 



A recombinant vector comprising the chimeric gene of claim 5. 
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10. A plant cell stably transformed with the vector of claim 9. 

1 1 . The plant cell of claim 10, which is a maize cell. 

12. The plant cell of claim 10, which is a wheat cell. 

13. The plant cell of claim 10, which is an Arabidopsis cell. 

14. An isolated DNA molecule involved in inducibility of a chemically inducible 
promoter selected from the following group: 

a) LS4 comprising nucleotides 3584 through 3593 of SEQ ID NO: 1 ; 

b) LS7 comprising nucleotides 3614 through 3623 of SEQ ID NO: 1 ; 

c) LS 10 comprising nucleotides 3644 through 3653 of SEQ ID NO: 1; and 



a region spanning LS7-LS10 and comprising nucleotides 3614 through 3653 of 
SEQ ID NO: 1 . 



1 5. The isolated DNA molecule of claim 1 4, wherein said DNA molecule comprises a 
negative regulatory element and comprises nucleotides 3584 through 3593 of SEQ ID NO: 1. 

16. The isolated DNA molecule of claim 14, wherein said DNA molecule comprises an 
element necessary for inducibility of said chemically inducible promoter and comprises 
nucleotides 3614 through 3623 of SEQ ID NO: 1 . 



17. A chemically inducible hybrid promoter comprising the DNA molecule of claim 16 
operatively linked to a minimal promoter fragment, whereby said DNA molecule confers 
inducibility to said minimal promoter fragment. 

18. The isolated DNA molecule of claim 14, wherein said DNA molecule comprises an 
element necessary for inducibility of said chemically inducible promoter and comprises 
nucleotides 3644 through 3653 of SEQ ID NO: 1. 
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1 9. A chemically inducible hybrid promoter comprising the DN A molecule of claim 1 8 
operatively linked to a minimal promoter fragment, whereby said DNA molecule confers 
inducibility to said minimal promoter fragment. 

20. The isolated DNA molecule of claim 1 4, wherein said DNA molecule comprises 
elements necessary for inducibility of said chemically inducible promoter and comprises 
nucleotides 3614 through 3653 of SEQ ID NO: 1 . 

21. A chemically inducible hybrid promoter comprising the DNA molecule of claim 20 
operatively linked to a minimal promoter fragment, whereby said DNA molecule confers 
inducibility to said minimal promoter fragment. 

22. A method of isolating transcriptional regulatory proteins, comprising the steps of: 

a) screening a cDNA expression library with an oligomer comprising the DNA 
molecule of claim 14; 

b) detecting clones expressing proteins that bind to said DNA molecule; and 

c) isolating said clones expressing said proteins. 

23. A method of isolating transcriptional regulatory proteins using a yeast one-hybrid 

system, comprising the steps of: 

a) fusing the DNA molecule of claim 14 upstream of a minimal promoter and a 
reporter gene coding sequence to construct bait: 

b) transforming the bait into yeast; 

c) transforming said yeast with a cDNA expression library fused to a yeast 
activation domain; 

d) activating transcription of the reporter gene upon specific interaction between 
the bait and a transcriptional regulatory protein; 

e) detecting a clone expressing said regulatory protein; and 

f) isolating said clone expressing said protein. 
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